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The proposed system also addresses the overfitting to the train samples 
problem of a feature extraction algorithm, named 2-Dimensional Linear 
Keywords: Discriminant analysis (2D-LDA). The samples in the proposed method are 
bootstrapped and the 2D-LDA performed on each group during the offline 
phase. While in the online phase, the tested class will be transformed into 
: subspaces using different eigenvectors that obtained from different samplings, 
Palmprint and the results matched with templates in the corresponding subspace. To 
evaluate the proposed method, two palmprint databases are used which are IIT 
Delhi Touchless Palmprint Database and PolyU palmprint database, and 
different rank-level fusion algorithms are investigated. The results of the 
proposed method show improvement in the identification rate. 
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1. INTRODUCTION 

Human’s physical and behavioral characteristics differ from an individual to another which give us the 
ability to distinguish a person from another. Nevertheless, this natural and accurate human skill is affected by 
memory limits which mean its inapplicable to memorize a large database of distinctive biometric features from 
palmprint or iris for example. Automatic biometric recognition systems are devised to overcome such 
problem [1], [2]. The most common human characteristics that can be used in biometric recognition systems 
are physical characteristics such as the face, Iris, fingerprint, palmprint and hand geometry, or behavioral 
characteristics such as speech, gait, keystroke, and signature. Furthermore, these characteristics vary in 
different aspects: accuracy, universality, distinctiveness, permanence, collectability, and acceptability. Among 
these characteristics, palmprint show high distinctiveness and fair acceptability [1]. 

However, the implementation of single biometric characteristics in a unimodal system (e.g. single 
sample, single fingerprint) has a limitation in the recognition accuracy due to the limited amount of the 
extracted information which affect the system performance [3], [4]. Multibiometric recognition systems have 
been emerged to solve this issue in the unimodal systems. Multimodal systems can be formed using multiple 
sensors, multiple samples, multiple instances, or multiple algorithms [3]. The information from these sources 
can be fused at different stages of the biometric recognition system. The fusion can be done before the 
matching: at sensor-level [5], [6] or at feature-level [7]-[9], or after the matching: at score-level [10]-[12], rank- 
level [13]-[15] or at decision-level [16]-[19]. Nevertheless, the use of multiple sensors system raises the cost 
of the biometric recognition system [4]. The multi-algorithms approach is proposed to obtain extra information 
from the same biometric data [20], [21] which eliminate the need for the extra acquisition equipment. However, 
this approach consumes substantial time because different feature extraction algorithms need to be applied first 
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and different type of classifiers may be needed depending on the nature of the extracted features. To fix this 
issue, Yan et al. [4] proposed extracting the same feature (i.e. SIFT feature) from multiple samples of palm 
vein images and fuses them at the feature-level during the offline phase for robust identification. Leng et. al. 
[22] merged 2D-DCT features from left and right palm at feature-level. While at the score-level, Cheng et al. 
[23] fused the score of matching from the repeated scan of fingerprints (i.e., multi-sample) during the online 
phase. Kumar and Shekhar [14] argued that the rank-level fusion is preferable over other fusion levels for 
multibiometric identification due to firstly, elimination of feature or score normalization, secondly, provide 
device and algorithms independency, thirdly, the actual identification may fall in second or third rank especially 
in large database due to limitation in the feature extraction algorithm or/and classifier efficiency. 

There are two common algorithms for feature extraction, named Linear Discriminant Analysis (LDA) 
and Principal Component Analysis (PCA). These algorithms require the image to be in 1-dimensional form 
before being feed into the PCA or the LDA, which is memory expensive. These algorithms also prone to the 
small sample size (SSS) problem, which means many samples required to extract reliable representation. Yang 
et al. [24] and Li & Yuan [25] proposed using the original images without converting them into one vector, 
which is more efficient. Another approach was proposed to overcome the SSS problem and reduce the 
overfitting to the training set by Nguyen et al. [26] and Wang & Tang [27]. These algorithms propose selecting 
the significant eigenvectors and randomly select other less important eigenvectors to make several weak 
classifiers and combine them to make better classifier. The eigenvectors that correspond to big eigenvalues are 
selected because they are better in discriminating the classes, in the LDA algorithm, or better in representing 
the data, in the PCA algorithm. However, the random selection of eigenvectors may lead to the inclusion of a 
noise or insignificant data which result in degraded performance [28]. 

In the proposed method the eigenvectors will be selected according to their significance instead of the 
random selection. Moreover, to avoid overfitting, the samples are bootstrapped, and the 2D-LDA applied to 
the different formations of training samples and different eigenvectors extracted from each formation. Later, 
these eigenvectors will be used to transform another sample that dedicated for testing into different subspaces 
to generate templates and save them in the system database during offline phase. While in the online phase, 
these different eigenvectors will also be used to transform the tested palmprint image into different subspaces 
and match the results with the classes in the corresponding subspaces. The results from these matches will be 
fused to find the final palmprint identity at rank-level. The general outline of the proposed method presented 
in Figure 1. 
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Figure |. The proposed method 


The rest of the paper is organized as follow: sections 2 represent the proposed method; section 3 shows 
the result and the experiment setup while section 4 represents the conclusion. It’s should be mentioned that the 
Maximum Rank Method and the majority voting scheme isn’t implemented due to the generation of ties which 
randomly broken to produce the tested class identity [13]. 
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2. THE PROPOSED METHOD 
The first step in the proposed method is dividing the classes’ samples G,,...,G, into two groups: the 
train group G; and the test groupG,, the train group has c classes and n samples, G,.,,, while the test group 


has one sample Gr_,. During the offline phase, the 2D-LDA applied on G, to extract the eigenvectors and 


these eigenvectors used to reduce the dimensionality of G; and generate the templates. The selection of the 
samples to be included in G, and Gr will vary to generate k different formations, G in and Gi, 1» 4S shown in 


Figure 2. 
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Figure 2. The Different Formations of G, and Gr, k here is 7 


The proposed method is represented mathematically as follow: 
T 
Sh = Dir VE,,,e0,(Ghen — Mi) (GE, — Mi) (1) 


where Sy, is scatter matrix within classes, M; is the mean for each class, Gi. is 2-dimensional 
palmprint image. n is the number of the samples for each class in G,. 


Mi = ~YG, ned: Glen (2) 


ni Gien€Di 
Sk = Y_, N; (M; — M)(M; — M)* (3) 


where Sz is the scatter matrix between classes m is the overall mean, and m; and N; are the mean and 
size of the respective class i. The final scatter matrix is evaluated as in the equation: 


-1 
A= Ss" Se (4) 
Finally, the eigenvector and eigenvalue are calculated as 
Ate = wv" (5) 
Where v is 1 X b eigenvector and w is the eigenvalue. To create the new subspace, h eigenvectors 
with corresponding high eigenvalues are selected 1, ..., v,. and each vector is multiplied with original images 
from the test group to form a feature vector. 
k — yk y Ck = 
Ya. = vq * Gr, Fo ao eee (6) 
The templates are constructed from the acquired feature vectors as: 


ME I py (7) 


In the online phase, the tested class transformed into subspace using the v* to generate feature 
matrices MT". To determine the tested class identity, the Euclidean distance is calculated between the MT* 
and MLK, 
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De(MTE, MLE) = |¥%,(MTE — MLE,)- (8) 


Lastly, different fusion schemes can be implemented using D¥ to obtain accurate class identity. 


3. RESULTS AND EXPERIMENT SETUP 

To evaluate the proposed method, the experiments conducted on two palmprint databases which are 
the IIT Delhi Touchless Palmprint Database and PolyU palmprint database. Different number of samples of 
palmprint images included in G, as available in the corresponding database. The votes are obtained by order 
the values of D V k in ascending fashion. IIT Delhi Touchless Palmprint Database includes images from left 
and right hands that gathered from 460 palms in a peg free form. The region of interest (ROD cropped 
automatically and included in the database which has the size of 150*150 pixels [29]. For each palm, four 
samples out of five used to form four formations. The noise that presents in IIT Delhi palmprint database in 
the form of distortion and translation considered as a factor that degrades the recognition rate if a single sample 
single-instance is used. PolyU palmprint database includes 7,752 palmprint images belongs to 386 palms which 
gathered using charge-coupled device (CCD) camera-based device with the help of pegs to fix palm position 
which minimizes the spatial distortion between the images that belongs to the same palm. The images have 
been collected through out two sessions. To mimic the real-world-scenario, these two sessions separated by 
around two months. In these two sessions, most of the subjects provide around ten images for left and right 
palms separately [30]. To keep the uniformity between the classes, the palmprints that have at less seven 
samples included in our experiment which can be summed to 378 different palms. These palms are divided 
randomly into two categories to represent the left and right palms by which each category has 189 palms. The 
ROIs for this database obtained using the method proposed in [31]. 

The fusion at score-level is not applicable for left and right palmprints fusion because palmprints for 
the right and left palms that belong to the same subject are not identical. Hence it is inapplicable to fuse the 
matching score from corresponding palms without former knowledge of the correlation between them. Xu ET. 
al. [32] showed a correlation between the principal lines of the left and right palms that belongs to the same 
person. Nevertheless, their experiment showed the best results when assigning a small weight for the fused 
scores from cross palms (w < 0.1). 

The algorithms that implemented in this paper for rank fusion are Borda Count and Bucklin Majority 
Voting, both with different rank levels. In the Borda Count Method, scores are assigned to every identity that 
produced by each classifier. The identity at the first rank receives highest points, while identities at lower ranks 
receive lower points according to their level. Later, these points are summed and the identity with the highest 
points will be considered as genuine identity [13]. In Bucklin Majority Voting, the first rank will be used to 
extract the identity votes and the class with the majority of votes is declared as the genuine identity. If there is 
no majority produced, the votes from other ranks will be included in a steps fashion depend on the mentioned 
condition. Table 1 and Table 2 show the identification rate, the average identification rate is obtained from 
averaging the results from the unimodal model at different samples formations, these results obtained using 
2D-LDA [25]. The number of the used ranks is between parentheses, while the number of ranks has a positive 
impact on the identification rate (for both palms) of PolyU database it has the opposite on the IIT Delhi 
database. The reason for such effect is that IIT Delhi classes have a clear intraclass variation in form of rotation, 
perspective, or distance from the camera, which reduce the discrimination power and result in scattered genuine 
identities along the different ranks. Also, the inclusion of more ranks can degrade the performance and increase 
the confusion in the final decision. 


Table 1. Recognition Rate (RR) for IIT Delhi database 








Borda Borda Bucklin Bucklin Bucklin 
Number Average of Average of ; Borda Majority Majority Majority 
é ; Count/left Count/right : i ; 
of train left palm right palm ims RR alms RR Count/both Voting/ left Voting/ Voting/ both 
images RR (%) RR (%) Pa a P oh palms RR (%) palmsRR tight palms palms RR 
sia nid (%) RR(%)_——_(%) 
2 55.9 61.7 
2(1) 65.6 68.6 86 
2(3) 54.7 58.6 76.5 
2(6) 52.1 56.9 75.2 59.5 64.7 80 
3 56.9 62.8 
3(1) 66.5 67.3 84.7 
3(3) 56.9 62.1 79.5 
3(6) 55.2 60.8 77.3 61.3 65.21 82.1 
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Table 2. Recognition Rate (RR) for PolyU palmprint database 








Numb Average Average Borda Borda Borda ae apes oe 
Bees, of left of right Count/left Count/right Count/both agony Aor ajorty 
of train Voting/ left Voting/ Voting/ 
; palm RR palm RR palms RR palms RR palms RR F 
images (%) (%) (%) (%) (%) palms RR right palms both palms 
(%) RR (%) RR (%) 
2 95.2 95.4 
2(1) 97.8 97.3 98.4 
2(3) 97.8 97.3 100 
2(6) 96.8 97.3 100 97.3 97.8 98.9 
3 95.1 95.2 
3(1) 97.8 97.3 98.4 
3(3) 97.3 96.2 100 
3(6) 96.8 96.2 100 98.4 97.3 98.9 
4 95.3 95.1 
4(1) 96.2 97.8 98.9 
4(3) 95.2 95.2 100 
4(6) 94.1 94.1 100 96.8 96.8 99.4 
a] 95.2 94.8 
5(1) 95.2 96.8 98.9 
5(3) 93.1 94.7 100 
5(6) 93.1 95.7 100 94.7 95.2 99.4 





4. CONCLUSION 

In this paper a new method proposed for a multi-sample multi-instance palmprint recognition system. 
In the proposed method, the palmprint images samples that used for training were selected differently prior to 
the training process. The proposed method fixed two problems, the first one is the overfitting to the train data. 
While the second problem is the limited discrimination information in the unimodal biometric recognition 
systems. The results indicate that the rising of the discrimination information lead to an improvement in the 
recognition accuracy and restraining the noise. These solutions achieved without implementation for extra 
algorithms or sensors. 
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